(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 

(43) International Publication Date 
10 October 2002 (10.10.2002) 




PCT 



liliBiiillllllllliiiiilililli 

(10) International Publication Number 

wo 02/080025 A2 



(51) International Patent Classification^: G06F 17/30 (81) Designated States (national): AB, AG, AU AM, AT, AU, 



(21) International Application Number: PCT/GB02/0n07 

(22) International FUing Date: 12 March 2002 (12.03.2002) 
(25) Filing Language: English 



(26) Publication Language: 



English 



(30) Priority Data: 
01302892.3 



28 March 2001 (28.03.2001) EP 



(71) Applicant ffor all designated States except US): BRITISH 
TELECOMMUNICATIONS PUBLIC LIMITED 
COMPANY [GB/GB]; 81 Newgate Street, London ECl A 
7AJ (GB). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): TATESON, Jane, 
Elizabeth [GB/GB]; 145 High Street. Wickham Market, 
Suffolk IPO ORD (GB). TATESON, Richard, Edward 
[GB/GB]; 145 High Street, Wickham Market, Suffolk IP13 
ORD (GB). 



AZ, BA, BB. BG, BR, BY. BZ, CA, CH, CN, CO. CR. CU. 
CZ, DE, DK, DM, DZ. EC, EE, ES, FI, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS. JP, KE, KG, KP, KR, KZ, LC, 
LK. LR, LS, LT, LU, LV, MA, MD. MG, MK. MN, MW, 
MX, MZ, NO, NZ, DM, PH. PL, PT, RO, RU, SD, SE. SG, 
SI, SK, SL, TJ, TM, TN. TR, TT, TZ, UA, UG, US, UZ, 
VN, YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM. 
KE. LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, 
GB, GR, IE, IT, LU, MC, NL, PT, SE, TR). OAPI patent 
(BF, BJ, CF, CG. a, CM. GA. GN. GQ, GW. ML, MR, 
NE, SN, TD, TG). 

Published: 

— with declaration under Article J 7(2)(a); without abstract; 
title not checked by the International Searching Atahority 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette, 



(74) Agent: LIDBETTER, Timothy, Guy, Edwin; BT Group 
Legal Services, Intellectual Property Department, 8th floor, 
Holbom Centre, 120 Holbom, London ECIN 2TE (GB). 



< 

in 



00 

o 

Q (54) Title: DATA RETRIEVAL SYSTEM 
^ (57) Abstract: 



wo 02/080025 



PCT/GB02/01107 



1 

DATA RETREIVAL SYSTEM 

This invention relates to data retrieval systems, and in particular to systems 
for assisting users making a selection from a large range of available items. It has 
5 application in searchable databases in which there are a large number of variables to 
consider and the user needs freedom to search according to his own preferred 
criteria, but the database is too large, or the user's criteria too poorly defined, for a 
fully structured search to be possible. 

In searchable databases a searcher is generally forced to navigate along a 

10 branching decision 'tree' towards a destination that will hopefully be what he wants. 
This is a good method for searching towards a known objective. However, because 
paths must be retraced to arrive at different destinations such a system is not so 
good for less structured searching ("browsing") where the objective is less clearly 
defined, or where several objectives may need to be inspected. The searcher is 

15 entirely at the mercy of the database's categorisation and will be unlikely to make 
chance finds, or form a general impression of what is available and thus direct his 
choices (a common strategy when shopping for clothes for example). 

The shortcomings of online searching are magnified still further when the 
bandwidth of the link between the user and the database is low. An attempt to 

20 'browse' an online database via a modem, for example, typically consists of a pause 
while the homepage loads, a relatively rapid selection by the searcher of a section 
within the database, another pause while the section page loads, rapid selection of a 
category of items, a further pause, etc. etc. Mobile access to the internet will mean 
that relatively low bandwidth online searching is likely to continue to grow even as 

25 people adopt high bandwidth connections for their fixed links. 

According to the invention there is provided an apparatus for selecting items 
from a database for display, comprising a data-storage means for storing, for each 
item, data indicative of the degree of similarity between that item and other items in 
the database; 

30 input means for receiving a user input identifying a first item in said database 

evolution processor means for specifying an evolved specification having a 
predetermined degree of similarity to the first item. 
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identifying from the database one or more variant items meeting the evolved 
specification, 

selection means for selecting a second item from amongst the variant items 
and output means for displaying an output identifying the selected second 

5 item. 

The invention also extends to a method of selecting items from a database 
for display, comprising the steps of: 

generating data indicative of the similarity between each item and other 
items in the database; 
10 receiving an input Identifying a first item in said database; 

generating an evolved specification, identifying a predetermined degree of 
similarity to the first item, 

selecting an item in the database meeting the evolved specification, 
displaying the selected second item. 
15 The invention also extends to a computer program for performing the 

method of the invention, and to a computer program product directly loadable into 
the internal memory of a computer, comprising software code portions for 
performing the steps of the method when the product is run on a computer. 

The invention also extends to a computer program product stored on a 
20 computer usable medium, comprising: 

computer-readable program means for causing a computer to generate data 
indicative of the similarity between each item and other items in a database 

computer-readable program means for causing the computer to receive an 
input identifying a first item in said database 
25 computer-readable program means for generating an evolved specification, 

specifying a predetermined degree of similarity to the first item, 

computer-readable program means for causing the computer to select an 
item in the database meeting the evolved specification, 

computer-readable program means for causing the computer to generate a 
30 display of the selected second item. 

Preferably a plurality of items are displayed, and the input means has means 
for selecting one of the displayed items to be the first item. In a preferred arrangment 
the output means has means for controlling the display means to replace one of the 
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plurality of items, preferably the item which has been displayed for longest without 
having been selected, by the selected second item. This can be an iterative process 
wherein the selected second item is added to the display, a further input being 
received relating to one of the displayed items, for selection of a further item for 
5 display. In order to identify which item is to be replaced, each displayed item may be 
allocated an age value, the age value being initially set at zero and Incremented 
periodically, the age value of a given displayed item being reset to zero if that item is 
selected, and the age values being used to identify items for deletion from the 
display. Selection of the second item may be constrained to prevent selection of the 

10 same item within a predetermined number of iterations of the process. 

The process, when allowed to repeat itself iteratively, allows the product 
selection process to perform an evolutionary search strategy with the user acting as 
the selective pressure, using "mutations" based on the most recent selection or 
selections. Such a process can create a serendipitous exploration of 'search space', 

1 5 more akin to the natural browsing process used in a shop or library. 

Existing evolutionary search strategies can be thought of as optimisation 
processes where the goal is to find the best set of values for the n dimensions of the 
search space. For example if n = 1, i.e. there is a single dimension V, the goal is to 
find the value of x for which some function f(x) is maximal. A number of 'individuals' 

20 are created with different values for x. The individuals that have high values for fix) 
are 'rewarded' with more "children" than individuals giving low values for f(x). The 
children of an individual have values for "x" which are similar to, but not identical to, 
the value of x for that individual. All the children are then evaluated according to their 
values for f(x), and rewarded with children accordingly. This process iterates until 

25 some termination condition is met. 

Continuously or discretely varying dimensions can be searched in this way, 
provided the 'mutation' operators which dictate how a child's value may vary from 
its parent's are constructed appropriately. For example, if x can take integer values 
from 1 to 100, an obvious mutation operator would be to add or subtract 1 from the 

30 parent's value to give the child's value (with suitable 'boundary conditions' to avoid 
less than 1 or more than 100). If x can take any real value from 1 to 100, the 
mutation operator might be to perturb the child's value according to some probability 
distribution around the parent's value (with boundary conditions again). 
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In the invention the goal is to find the best item in a database. This is an 
inherently discrete search - as the number of items in the database is finite. It 
should be noted that in the context of browsing the criteria defining "best"' are not 
defined initially. 

5 In one version of the invention these items are described according to n 

attributes, and those attributes are the n dimensions for an evolutionary search. Each 
item may be allocated specified values for each of the set of attributes, (the degree 
of similarity between any two items being identifiable by the number of attributes for 
which the two items have values in common). Items are displayed to the user, and 

10 rather than evaluating each item according to some objective function, the 
'optimality' or 'fitness' of each item is determined by rewards from the subjective 
user. The more rewarded items have a greater chance of contributing to the next 
generation. As in evolutionary search, a child is generated by 'mutating' the 
attributes of the parent. The evolved specification is therefore selected by generating 

15 a set of attribute values differing by a predetermined degree from the set of attribute 
values of the first item, the second item being selected from the database from those 
having attribute values corresponding to the generated set of attribute values. 
However, it is possible that the n dimensional search space will not precisely match 
the database. There may be some values of the n attributes, which do not 

20 correspond to any item in the database. Conversely for some sets of n values there 
may be more than one item. Therefore we could create a notional 'child', which does 
not actually exist in the database, or a child that corresponds to several real items in 
the database. The process of choosing the next item to display allows for both these 
possibilities. 

25 The evolved specification may be determined according to the attributes of 

two or more previous inputs, and to this end the apparatus may comprise recording 
means for recording the selections made on each cycle of operation of the apparatus, 
wherein the attribute generation means is arranged to generate a set of attributes 
related to attributes of items recorded as having been selected within a 

30 predetermined number of previous cycles of operation of the apparatus 

A special attribute set may be defined for each item, each attribute of the set 
representing, not properties such as colour, etc, but the degree of similarity between 
that item and some other item. For each item in the search space, there would be 



wo 02/080025 



PCT/GB02/01107 



5 

one of these attributes relating to each other item in the search space. Each specified 
value may simply define the presence or absence of an association between the two 
items. In this embodiment the search space may be considered to be organised as a 
connected graph. In other words, rather than using the same set of attributes to 
5 organise (or 'create') search space, and to navigate it, navigation of the space 
follows a series of connected points. The evolved specification can then be 
generated by specifying a predetermined value for the term of each attribute set 
relating to the first item. 

In order to achieve this the system operator of the search space must first 

10 determine what items should be "'adjacent" to each other. This may be done by a 
human operator, or in a semi-automated process in which every item is given values 
for a set of attributes and then placed in a search space that has the same number of 
dimensions as there are attributes. Every item then has a set of neighbours, defined 
as being those items located in this space within a specified distance. Again, this 

15 distance may reduce on successive iterations of the process. Navigation is carried out 
by moving from the selected items to one of its set of neighbours. This constitutes a 
"mutation". 

In a further embodiment the space is once again organised as a connected 
graph, but rather than allowing navigation to follow a series of connected points, the 

20 next item to display may be taken from anywhere in the database. Each attribute 
may be associated with a weighting value, such that on receipt of an input relating to 
a first item, the weighting values of the attributes associated with the first item are 
increased, and the evolved specification is generated such that items having 
attributes allocated higher weightings have a greater probability of selection than 

25 those with lower-weighted attributes. Clicking on an item rewards all items 
connected to that item. When choosing the next item to display, this method biases 
the choice according to the number of rewards accumulated by items over the course 
of the search. 

The generated set of attribute values may be determined according to the 
30 attributes of two or more previous inputs, and the selections made on each cycle of 
operation of the apparatus may be recorded, and the selection of the second item 
constrained to prevent selection of an item recorded by the recording means as 
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having already been selected within a predetermined nunr^ber of previous cycles of 
operation of the apparatus. 

Although the embodiments to be described are used for on-line retail 
shopping, and specifically for clothes, many other applications are possible. For 
5 example, the invention may be used for fashion material 'buyers' to browse towards 
colours, patterns, textures they like. On-line browsing is also particularly suited to 
fields of estate agency (real estate) and travel agency, where the products on sale 
are inherently difficult to display, and auction houses, which have big catalogues of 
Items that, because they are unique, cannot readily be physically displayed to a wide 
10 audience. The invention may also be used for selecting other items from a large 
database, such as "clip-art" images for incorporation in graphic displays such as 
presentation slides, many databases of which are difficult to browse because of the 
wide range of criteria under which they might be catalogued. The invention may also 
be used for on-line news feeds, arranging for pop-up windows with news information 
1 5 having content similar to items previously selected. The invention may also be applied 
to Identikit or e-fit systems for identifying criminals or missing persons, either by 
searching through a database of real people, or by generating a face from a witness's 
description. Rather than being on-line in the usual sense, the invention may be 
applied to an In-store Kiosk, for finding a desired Item using a terminal in a real shop 
20 before collecting it from 'goods out'. 

Embodiments of the invention will now be described, by way of example 
only and with reference to the drawings, in which: 

Figure 1 illustrates schematically the inter-relationships between the various 
elements that co-operate to perform the Invention; 
25 Figure 2 is a flow chart illustrating the process performed by a first 

embodiment of the invention; 

Figure- 3 is a flow chart illustrating the process performed by a second 
embodiment of the invention; 

Figures 4, 5, 6 and 7 illustrate displays that may appear during an illustrative 
30 run of the process. 

Figure 8 illustrates the database used to support the processes illustrated in 
Figure 4 to 7. 

Figure 9 illustrates in graphical form the data depicted in Figure 8. 
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Figure 10 illustrates the same data in a different graphical form. 
Figure 1 1 illustrates a further embodiment of the invention 
Figure 1 2 illustrates an additional step used in a multiple user variant of the 
embodiments of Figures 3 and 1 1 
5 Figure 1 illustrates a user terminal 10 connected through a communications 

network 1 1 such as a low-bandwidth telephone connection to a server 1 2. The 
server has access to a database 13, and itself comprises a number of subsystems, 
which will typically be implemented by software. These subsystems include a receive 
port 14, a session recording database 15, an evolution processor 16, a selection 
10 processor 17, and an output port 18. An order-processing server 19 is .also 
associated with the system. 

It should be understood that the distribution of the elements may be varied. 
For example a client server, interposed between the network 1 1 and main server 1 2, 
may perform some of the functions performed by the terminal 10 in the described 
15 embodiment. Alternatively, the process could be njn on the user terminal 10, 
accessing the data directly from an online database 13. 

In use, the system offers the searcher one or more search spaces or 
'gardens', which can either be held locally or by the service provider. These are the 
areas within which the searcher, over the course of one or many sessions, will 
20 cultivate a collection of items of interest to the user. The service provider creates a 
'search space' which is a multidimensional space consisting of all items available. A 
simplified example is shown in Figure 9. The neighbourhood of an item is populated 
with items that, in one or more characteristics, are similar to that item. 

The database 1 3 stores a catalogue of all the items available for inspection, 
25 classified by a large number of attributes. For example, clothes may be classified by 
type (shoes, hats, shirts, etc), colour, pattern, style, designer, price and so on. 

The database can be set up by manual entry or by extracting data from a 
catalogue. Items presented in spreadsheet form are particularly suited for compiling 
the database, using the various column entries as the categories. 
30 The process performed by a first embodiment of the invention is represented 

in Figure 2. A searching session operates as follows. The user of the terminal 10 
opens a search space or ''garden" (new or pre-existing) with a descriptor, which may 
be general (e.g. 'clothes') or more specific (e.g. "trousers"). Certain other limitations 
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may be added to limit the variety of items available for display: in particular the user 
may specify clothes sizes, to avoid the display of items not available in the user's 
own size. Subject to any such predetermined limitations, the selection processor 1 7 
selects items, initially at random, from the database 13 and passes them to the 
5 output port 18 for onward transmission to the user (step 20). To make the best use 
of the narrow bandwidth available on most home user's equipment, the output port 
18 includes a buffer store so that it can continuously provide the user terminal 10 
with items from the database 13. New items then start arriving in the display 
(description plus picture wherever appropriate). Initially these items are randomly 

10 selected from all items within the 'search space' shown in Figure 9, subject to any 
initial limitations imposed. When the user terminal 10 receives a new item it 
allocates it an "age" value, which is initially zero (step 21). This characteristic is 
incremented either in accordance with chronological time or when further items are 
added to the display, and items achieving a predetermined age are deleted from the 

15 garden. 

In the simplified illustrated examples shown in Figures 4 to 10, a number of items, 

identified by the characters A, B, C, D Z are available for display. These are 

stored on the database 13 each with a number of associated attributes. For the 
purposes of this example only three attribute categories are identified in Figure 9, 

20 namely garment type, colour and pattern. Several further such attributes are listed in 
Figure 8, although not used in this example. In practice many different attributes 
such as price, designer, age group, material, size, etc, would be used. Each item 
would then have an entry for an attribute in each of these categories (e.g. Designer 
Paul, Jacket, Grey, plain, adult, wool mixture, 96cm chest, £60). The attributes may 

25 be considered as defining a position in a multidimensional "search space" 90, in 
which items sharing an attribute would be adjacent to each other in the relevant 
dimension, as shown for the three illustrative attributes in Figure 9. (In Figure 9 the 
italicisation of items B, E, F, 0, P, U, W represents their location in a different plane 
from that containing the other items). 

30 Each item to be displayed is selected by choosing values for each category 

and then picking an item that matches all those values. Initially this selection is 
unbiased. For example, if there are eleven different designers and eight different 
garment types, there would be a 1 in 1 1 chance that 'Designer Paul' would be the 
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designer chosen for the first item to display and a 1 in 8 chance that the garment 
would be a jaclcet. The user can passively observe items entering the display as 
long as he likes. At any time the user may identify an item of interest to him. Such 
an item would be one that attracts the user as being of a kind worthy of further 
5 consideration, for example the item "J" in the display of Figure 4, 

When an item is selected the age value of that item is reset to zero (step 22) 
and a signal is transmitted over the communications link 1 1 (step 23) to the receive 
port 14, causing the product identifier to be stored in the session recording database 
15. The evolution processor 16 then applies an evolutionary search space technique 
10 to the data received. This is the point at which the arrival of new items deviates from 
a random sample. 

In the embodiment of Figure 2 the evolution processor 1 6 firstly retrieves the 
attributes of the selected item J stored in the database 13 (step 24). It then uses 
these attributes of the selected item to bias the random process of choosing the set 

15 of attributes for the next item to display. With this bias included, the choice of 
attributes Is then made and the resulting 'evolved' attributes are passed to the 
selection processor 17 (step 25). In a preferred arrangement the evolution processor 
16 uses the history of the last few selections retrieved from the session recording 
database 15, and not just the current selection, to determine which attributes to 

20 influence the biasing of attribute choice. This allows new selections to have more 
than one "parent". 
As an equation: 
Pel = (1 +nci ) / (Nc + ntotai) 
where: 

25 Pd is the probability that when choosing the next image to display, the value i will be 
chosen for category c 

nci is the number of times that the searcher has clicked on an item with the value i for 
its category c 

Nc is the number of different values of category c 
30 ntotai is the total number of clicks by the searcher in this session 

For example, if there are eleven different designers and eight different 
garment types, there would initially be a 1 in 11 chance that 'Designer Paul' would 
be the designer chosen for the first item to display and a 1 in 8 chance that the 



wo 02/080025 



PCT/GB02/01107 



10 

garment would be a jacket. If the searcher selects a Designer Paul jacket, then the 
weightings are modified such that the probability that the second item displayed 
would be another item by the same designer rises from 1/11 10.091) to 2/12 
(0.167), and the chance of getting another jacket would be weighted to rise from 1/8 
5 (0.125) to 2/9 (0.222). (It can easily be seen that the probability of getting another 
jacket from the same designer is still very small, at 1 1ll = 0.037, but this is greater 
than the random probability of 1/88 = 0.011. The increments to the probabilities of 
choosing particular category values continue to accumulate throughout the user 
session. Note that the initial weightings do not take account of the number of 

10 available items in each category - each designer has an equal chance of being 
selected initially, however many of his individual products appear in the database. 

Once the random selection process has decided on the category values for 
the next item, the database of items (13) is searched by the selection processor (17) 
to identify items that match those values. If more than one item satisfies these 

1 5 criteria, one of them is chosen at random with equal probability 

The session recording database 15 is consulted to ensure that items that 
have already been suggested are not repeated (step 27), with another selection 
having the same criteria being made if possible (step 26). If a predetermined number 
of attempts to select an item having these criteria fail (because all such items have 

20 previously been selected, or if there is no item in the database with the set of 
category values, a counter (system 271) times out and a new set of category values 
is generated (return to step 25). 

The selection processor 1 7 next passes the selected items to the output port 
18 for onward transmission. At the user terminal, each suggestion offered by the 

25 system is added to the display, displacing the Item having the greatest "age" (step 
28). 

This method is simple and computationally efficient, and can readily be 
extended to a multi-user situation as will be discussed. It also tends to focus the 
search rapidly because the percentage change of the probability resulting from a 
30 reward is largest when that value for the category has not been rewarded much 
before (change from 1/11 (0.091) to 2/12 (0.167) is bigger than, later in session, 
33/217 (0.152) to 34/218 (0.156). This might or might not be an advantage 



wo 02/080025 



PCT/GB02/01107 



11 

depending on what is a good mix of focus versus search. It would be possible to use 
a different function relating selections to probabilities if required. 

However, the search is biased towards showing the searcher items with 
unusual category value combinations: for example there might be several different 
5 Designer Paul grey jackets for adults but only one Designer Paul red trousers for 
adults. Furthermore, its efficiency will be adversely affected if the space of possible 
category value combinations is sparsely populated with actual items i.e. if most 
randomly generated sets of category values do not correspond to any Item in the 
database, resulting in the algorithm having to "re-roll the dice" many times before 

10 hitting on a combination of values which does match an item. 

For the purpose of the embodiment of Figure 3, links are defined between 
certain items, as shown in Figure 10. These links may relate to individual attributes 
by which the items are categorised, or may be determined empirically by research 
data indicative of searcher preferences. In practice, both methods of determining 

1 5 such associations may be used to define the links illustrated in Figure 10 

The processes of Figure 3 (steps 30, 31, 32, and 33) follow a similar 
procedure to that of Figure 2 (steps 20, 21, 22, 23) up to the point where the 
evolution of the search space departs from random, as the search strategy employed 
is different. In the embodiment of Figure 3 a predetermined neighbour list Is 

20 generated for each item as shown in the right hand column of Figure 8 and indicated 
by the links between items in Figure 10. It should be noted that a link could relate to 
any connection that may exist between the two items. For example market research 
data indicating that purchasers of a given item commonly also buy another item may 
be used to generate such a link between otherwise apparently unrelated items. The 

25 links may all be of unit value, in which case there is an equal probability of choosing 
any neighbour, or may take real values between 0 and 1, in which case the 
probability of choosing any neighbour is proportional to the value, or 'weighting', of 
the link. 

In this embodiment an item is selected from the neighbour list of the 
30 previously selected item (step 36), and is displayed (step 38). In this way the display 
is made to "evolve" towards a group of items that are all either selected by the user, 
or linked to such items. As in the embodiment of Figure 2, a check is made (step 37) 
to avoid duplication, and a further selection from the neighbour list made if possible 
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Step 371, step 36 repeated) or, if no such item is available, a random selection is 
made (step 361). The selection processor 17 next passes the items selected (in step 
36 or 361) to the output port 18 for onward transmission. At the user terminal, each 
suggestion offered by the system is added to the display, displacing the item having 
5 the greatest "age" (step 38). 

In a third embodiment, shown in figure 1 1 , the search space is organised 
using links between items as in the embodiment of figure 3. However, in this case 
the next item to display is chosen probabilistically from all items in the database, 
more like the embodiment of figure 2. The links are used to allow the 'reward' of 

10 clicking on one item to spread to neighbouring items and hence increase the 
probability that those items will be chosen for display by the biased random selector. 

On each iteration, one item is selected at random from the database (step 
46) for display. Except on the first iteration, when all items are equally likely to be 
displayed, the probabilities of individual items being selected for display are weighted 

15 according to the results of previous iterations. A check is first made (step 47) to 
ensure the item has not been displayed before (in which case a new selection is 
made), and the newly selected item is added to the display, displacing the oldest 
(step 48). The ages of the items on display are then incremented. The user may then 
select an item from the display (step 42). If such a selection is made, the weightings 

20 of each item in the database linked to the selected item are increased (step 45), so 
that on subsequent iterations the selection is biased towards items in which the user 
has previously shown interest. The user is also offered the option of buying the 
selected item (step 49) as in the other embodiments. 

Another item is then selected from the database (step 46), using the 

25 adjusted weightings. If after a predetermined interval no selection is made by the user 
(step 42) a selection is made based on the existing weightings (step 46) 

Taking the same example as the previous embodiment, in this case an M x M 
matrix is created where M is the total number of items. Each row corresponds to an 
item, and each entry in that row is a number indicating the strength of association 

30 between that item and another item. 

There is also a vector with M terms, which is updated as the searching 
session goes on. Each term pn of the vector represents the probability that the 
corresponding item n will be selected. Initially, all terms pn are set equal. 
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The next item to show is chosen randomly, taking into account the 
probability factors pn. 

The vector is updated in response to the searcher's clicks as follows: 
i) the searcher clicks on an item 
5 ii) the row in the M x M matrix corresponding to that item is found 

iii) the values of the terms in that row are added to the vector 

iv) the vector is normalised so that the sum of terms is equal to M 

This process gives a .fine granularity of relationships between items. It 
needn't be tied to categories (if we want to make a grey jacket by one designer 

10 highly associated with an olive shirt by a different designer, we can). The 
relationships need not be symmetrical (people who like the jacket could be shown the 
shirt but not vice versa). The rapidity with which this method focuses the search can 
be set by parameterising the function describing the update of the matrix. This 
allows the system operator or even the searcher to alter the 'exploitation versus 

1 5 exploration' of the algorithm. . 

To avoid loss of Information about a searcher's clicking behaviour, which 
would otherwise occur when the single updated vector Is used in this method, the 
history of clicks may be used to make inferences about the main driver(s) of the 
searcher's search (e.g. looking for red things). This may allow faster focus than 

20 relying on this information being implicitly recovered by the item-to-item links 
method. To carry out this variant, there are three steps 

Firstly, a set of 'hypothesis' vectors, each containing M binary terms are 
produced. A '1' in the Mth position means 'the Mth item conforms to this 
hypothesis'. For example, the hypothesis might be 'red items' and the vector would 

25 simply have a 1 in each position corresponding to a red item. 

Next, a history vector is maintained where the Mth term is incremented 
whenever the searcher clicks on item M 

The history vector is compared with the hypothesis vectors to infer which 
are the most likely explanations for user behaviour. 

30 This process allows the extraction of comprehensible information, such as a 

preference by a particular customer for the colour red. With enough data (a long 
single session, or many users' short sessions) it may allow the formulation of new 
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hypotheses, which also retain some explanatory meaning and hence might be useful 
to retailers. 

These embodiments may be developed in a number of ways to make use of 
correlations between users. Two approaches are discussed here, one linked to 
5 purchased items and another relying only on navigation behaviour. 

In the first, data is gathered from the searching sessions of many users over 
time. Information can be retrieved relating to the most popular purchased items and, 
for each item in the database. Information can also be stored relating to the number 
of times over the history of user sessions that selecting one specified item at some 
10 point in the session correspond to eventual purchase of some other specified item, 
(e.g. we record 1000 user sessions and find that there were fifteen occasions when 
a user who eventually bought the Designer Paul grey jacket had rewarded the 
Designer Peter green sweater. The sweater is now linked to the jacket with a 
weighting of 15). 

15 When a new session starts, this information may be used to preferentially 

display a 'top selling' item. This could be done at random throughout the session, or 
particularly when the user has not rewarded any items for a while. The top selling 
item to display is picked by looking at the history of selections during the current 
session. Each item selected will have a link of a certain strength (possibly zero) 

20 relating it to each of the top sellers. For each top seller the link strengths of rewarded 
items are summed. The item to display is then chosen from those top sellers with a 
probability proportional to the sum of link strengths. 

An alternative approach relies on the search space being navigated using item- 
to-item links and can be applied to the embodiments of figures 3 and 1 2. It uses the 

25 selection behaviour of users to directly alter values in the matrix of links between 
items. Rewarding an item C and then rewarding an item D leads to an increment in 
matrix value at (c,d). If it is desired that the matrix is symmetrical, it can of course 
also increment (d,c). If the next rewarded item is E, there will be an increment to 
(d,e). There may also be a lesser increment for (c.e). 

30 In Figure 1 2 we are to imagine there were selections of items A, B and C 

which preceded the selection of D so there has already been a sequence of four 
selections during the user session. 
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Figure 12 shows the result of the fifth rewerd i.e. clicking on item E. The 

primary incrementing function Fi is applied to the matrix value relating the new 

rewarded item (E) to the previous (D). Secondary, tertiary, etc. functions F2, F3, F4. 

etc increment links directly between items further down the chain and the newly 
5 selected item. Of course during the clicks on B, C and D there have already been 

increments to the links among A, B, C and D. 

The alterations made to the matrix during a user session would be 

incorporated into the centrally held matrix and would thus affect the behaviour of the 

system for subsequent users. 
10 In practical terms, it is anticipated that the functions applied to the matrix 

values would each result in small increments so that the behaviour of a single user 

does not grossly distort the search space. 

The system operator may use the same process to generate new links 

between items, or alter the strength of existing links. An administrator browses the 
15 search space looking for items that are to be linked. For example, if the administrator 

sees three red items on the screen, clicking on each will make (or strengthen) the 

links between them. 

The displays generated by the systems of any of the embodiments may 

appear similar to the user, and typical displays are shown in Figures 4 to 7. The 
20 "age'' value associated with each item is also shown in Figures 4 to 7, although this 

would not normally be displayed. From the display shown in Figure 4, the user 

selects item "J", and as previously described items X, K are added. These replace the 

items F, P with the highest "age" value, as shown by comparison of Figures 4 and 5. 

The age value of item "J" is re-set to zero. The process then continues: the user 
25 selecting, for example item "R" {Figure 5), and items "S" and "L", which each share 

an attribute with item "R'' (if running the process of Figure 2) or which are each on 

the neighbour list of item "R" (process of Figure 3) are displayed (Figure 6), replacing 

items "C" and "W" which now have the highest age values, and resetting the age 

value of item "R" to zero. Item "L" is next selected (Figure 6), causing the display of 
30 two further items "N" and "M" (Figure 7), replacing items "J" and "Y" which now 

have the highest age values. 

As the session proceeds, the display will be increasingly populated with 

items that have either been recently selected by the user, or are descended from 
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such items. The continued presence of selected items is achieved by the resetting of 
their ages to zero when they are selected or by moving top choices to a separate 
window on the display screen. The evolved garden therefore includes a mixture of 
items sharing some attributes with items the user found interesting, but probably 
5 including at least some things which were not in the searcher's mind when the 
session began. As each new item is selected in accordance with the previous few 
selections made by the user, the display will gradually evolve to show items likely to 
be of interest to the user. Suggestions that do not prompt the user to select them 
disappear from the system as their age value increments, and no similar suggestions 
10 are made. 

When the user sees an item he wishes to order, he selects the item as before 
but now transmits a signal 29 (39, 49) indicating he wishes to purchase it, which is 
directed to the order-processing server 19. 

As will be understood by those skilled in the art, any or all of the software 

15 used to implement the invention can be contained on various transmission and/or 
storage mediums, so that the program can be loaded onto one or more general 
purpose computers or could be downloaded over a computer network using a suitable 
transmission medium. The computer program may be embodied on any suitable 
carrier readable by a suitable computer input device, such as CD-ROM, optically 

20 readable marks, magnetic media, punched card or tape, or on an electromagnetic or 
optical signal. 
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CLAIMS 

1 . Apparatus for selecting items from a database for display, comprising a data- 
storage means for storing, for each item, data indicative of the degree of similarity 
5 between that item and other items in the database; 

input means for receiving a user input identifying a first item in said database 
evolution processor means for specifying an evolved specification having a 
predetermined degree of similarity to the first item, 

identifying from the database one or more variant items meeting the evolved 
10 specification, 

selection means for selecting a second item from amongst the variant items 
and output means for displaying an output identifying the selected second 

item. 

15 2. Apparatus according to claim 1, comprising display means for displaying a 
plurality of items, and wherein the Input means has means for selecting one of the 
displayed items to be the first item. 

3. Apparatus according to claim 2, wherein the output means has means for 
20 controlling the display means to replace one of the plurality of items by the selected 

second item. 

4. Apparatus according to claim 3, wherein the selected second item replaces 
the item which has been displayed for longest without having been selected. 

25 

5. Apparatus according to claim 4, further comprising means for generating a 
display of the selected items, means for allocating to each displayed Item an age 
value, the age value being initially set at zero, means for periodically incrementing the 
age value of each displayed item, means for re-setting the age value of a given 

30 displayed item to zero in response to an input received by the input means identifying 
that item, and means for deleting from the display items having an age value greater 
than a predetermined value. 
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6. Apparatus according to claim 1 , 2, 3, 4 or 5 wherein the data storage means 
comprises means for allocating to each item specified values for each of a set of 
attributes, the degree of similarity between any two Items being identified by the 
number of attributes for which the two items have values in common. 

5 

7. Apparatus according to claim 6, further comprising attribute evolution 
generation means for generating a set of attribute values differing by a predetermined 
degree from the set of attribute values of the first item, and selection means for 
selecting from the database a second item having attribute values corresponding to 

10 the generated set 

8. Apparatus according to claim 1, 2, 3, 4, or 5, wherein the data storage 
means comprises means for allocating to each item specified values for each of a set 
of attributes defining its degree of similarity to each other item 

15 

9. Apparatus according to claim 8, the evolution specification means comprising 
means for defining an evolved specification specifying a predetermined degree of 
similarity to the first item 

20 10. Apparatus according to claim 8 or 9, wherein each specified value defines 
the presence or absence of an association between the two items 

1 1 . Apparatus according to claim 7 or 8, wherein the generated set of attribute 
values is determined according to the attributes of two or more previous inputs. 

1 2. Apparatus according to claim 1 1 comprising recording means for recording 
the selections made on each cycle of operation of the apparatus, wherein the 
attribute generation means is arranged to generate a set of attributes related to 
attributes of items recorded as having been selected within a predetermined number 
of previous cycles of operation of the apparatus 



25 



30 
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13. Apparatus according to claim 6, 1, 8, 9, 10, or 11 comprising means for 
associating with each attribute a weighting value, and means for increasing the 
weighting values of the attributes associated with a first item on receipt of an input 
relating to the first item, and wherein the means for retrieving the second item is 

5 operable such that items having attributes allocated higher weightings have a greater 
probability of selection than those with lower-weighted attributes. 

14. Apparatus according to any preceding claim, comprising recording means for 
recording the selections made on each cycle of operation of the apparatus, and 

10 wherein the selection means is arranged to constrain the selection of the second item 
to prevent selection of an item recorded by the recording means as having already 
been selected within a predetermined number of previous cycles of operation of the 
apparatus. 

15 15. Method of selecting items from a database for display, comprising the steps 
of: 

generating data indicative of the similarity between each item and other 
items in the database; 

receiving an input identifying a first item in said database; 
20 generating an evolved specification, identifying a predetermined degree of 

similarity to the first item, 

selecting an item in the database meeting the evolved specification, 
displaying the selected second item. 

25 16. A method according to claim 15, wherein a plurality of items is displayed, 
the first item being one of the items so displayed. 

17. A method according to claim 16, being an iterative process wherein the 
selected second item is added to the display, a further input being received relating to 

30 one of the displayed items, for selection of a further item for display. 

18. Method according to claim 16 or 17 wherein one of the plurality of displayed 
items is replaced by the selected second item. 
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19. A method according to claim 18, wherein each displayed item is allocated an 
age value, the age value being initially set at zero and incremented periodically, the 
age value of a given displayed item being reset to zero if that item is selected, and 
the age values being used to identify items for deletion from the display. 

5 

20. Method according to claim 19, wherein the selected second item replaces 
the item that has been displayed for longest without having been selected. 

21. A method according to claim 18, 19, or 20 wherein the selection of the 
10 second item is constrained to prevent selection of the same item within a 

predetermined number of iterations of the process. 

22. A method according to any of claims 15 to 21, wherein each item has 
allocated specified values for each of a set of attributes, the degree of similarity 

15 between items being identified by the number of attributes for which they have 
values in common. 

23. A method according to claim 22, wherein the evolved specification is 
selected by generating a set of attribute values differing by a predetermined degree 

20 from the set of attribute values of the first item, and the second item is selected from 
the database from those having attribute values corresponding to the generated set 
of attribute values. 

24. A method according to claim 23, wherein the evolved specification is 
25 determined according to the attributes of two or more previous inputs. 

25. A method according to any of claims 15 to 21, wherein, for each item, an 
attribute set is defined, the terms of the attribute set representing the degree of 
similarity between that item and each other item. 

30 

26. Method according to claim 25, wherein the evolved specification is 
generated by specifying a predetermined value for the term of each attribute set 
relating to the first item. 
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27. Method according to claim 25 or 26, wherein each specified value defines 
the presence or absence of an association between the two items 

28. A method according to claim 21, 22, 23, 24 or 25, wherein each attribute is 
5 associated with a weighting value, and wherein on receipt of an input relating to a 

first item, the weighting values of the attributes associated with the first item are 
increased, and wherein the evolved specification is generated such that items having 
attributes allocated higher weightings have a greater probability of selection than 
those with tower-weighted attributes. 

10 

29. A computer program for performing the steps of any of claims 15 to 28. 

30. A computer program product directly loadable into the internal memory of a 
computer, comprising software code portions for performing the steps of any of 

1 5 claims 1 5 to 28 when said product is run on a computer. 

31. A computer program product stored on a computer usable medium, 
comprising: 

computer-readable program means for causing a computer to generate data 
20 indicative of the similarity between each item and other items in a database 

computer-readable program means for causing the computer to receive an 
input identifying a first item in said database 

computer-readable program means for generating an evolved specification, 
specifying a predetermined degree of similarity to the first item, 
25 computer-readable program means for causing the computer to select an 

Item in the database meeting the evolved specification, 

computer-readable program means for causing the computer to generate a 
display of the selected second item. 



wo 02/080025 



PCT/GB02/01107 



1/12 




wo 02/080025 



PCT/GB02/01107 



2/12 



20 



Server Provides 
Random Selection 



21 



Set Age = 0 



New Item 
Displaces 
Old One 



11 



J 



23 



User Selects Item: 
Reset Age = 0 



Transmit Request 




Yes 



OrderingYesX \ ] 
procST ^ ^"^^ 




29 



Buy? 
^No 



Retrieve Attributes 



24 



Select Items 
With Evolved 
Attributes 



ncremen; 
counter 



Evolve New 
Attributes 




Yes 



wo 02/080025 



PCT/GB02/01107 



3/12 



30 



Server Provides Initial 
Random Selection 



Set Age = 0 



31 



32 



User Selects Item: 
Reset Age = 0 



33 



Ordering 
pro c i lj 



en 



New Item 
Displaces 
Old One 





Server 




Provides 


< — 


Single 




Random 




Item 



ncremen 
counter 





Select 
Related 
Item 



No 



PL, 



36 



wo 02/080025 



PCT/GB02/01107 



4/12 




• 1-H 



wo 02/080025 



PCT/GB02/01107 



5/12 




PL, 



wo 02/080025 



PCT/GB02/01107 




wo 02/080025 



PCT/GB02/01107 



7/12 







(0)N 


4444 
4444 


4444444^^^4^^ 
44444^4^^^4^4 

^'^i44^4^4^^4^^ 
^^♦4^4^44^ 
^^♦4^4^44^ 
^^y^444^^^4^^^ 
^44444^444^>^ 
4444^#^^^^^^^^ 


4444 
4444 
4444 





4^44 
4^44 

4444 


1— 


4444444^^44^^^ 
44^^44^^^^4^^^ 
4^^4444^^^^44 
4^4^4^^^^^^^4< 
4^4444^^^^4^4^^ 
444444^^^^4^^^ 
4^^^444^^^4^^< 
#4444^^^^^4^^^ 


4^^4 
4^^4 

(►♦♦4 





wo 02/080025 



PCT/GB02/01107 



8/12 



Neighbours 


BGW 


AIJ 


CD 

UJ 


2 

UJ 


CDP 


CD 


u. 

O 

< 


KN 


U- 
CQ 


BX 


X 

—} 


MNRZ 


-J 

Q 


X 
-J 




UJ 

O 


Q. 


LS 




>- 
> 


X 
> 


TU 


AOY 


N 

—» 
Z) 


Ml 


LX 


Designer 






















































Price 






















































Colour 


Black 


White 


Black 




White 


White 


Black 




White 












White 


White 


Black 






Black 


White 


Black 


White 




Black 




Pattern 


Plain 


Plain 


Plain 


Stripes 


Plain 


Plain 


Plain 


Dots 


Plain 


Stripes 


Stripes 


Dots 


Check 


Dots 


Plain 


Plain 


Plain 


Dots 


Stripes 


Plain 


Plain 


Plain 


Plain 


Stripes 


Plain 


Check 


Type 


Coat 


Coat 


Hat 


Hat 


Hat 


Lingerie 


Lingerie 


Lingerie 


Lingerie 


Shirt 


Shirt 


Shirt 


Shirt 


Shirt 


Shirt 


Shoes 


Shoes 


Skirt 


Skirt 


Skirt 


Skirt 


Skirt 


Trousers 


Trousers 


Trousers 


Trousers 


Item 1 


< 


CD 


o 


Q 


UJ 


u. 




I 








_i 




z 


O 


OL 


O 


a: 


CO 


H 


3 


> 


5 


X 


>- 


N 



00 




wo 02/080025 



PCT/GB02/01107 




wo 02/080025 



PCT/GB02/01107 



10/12 




wo 02/080025 



PCT/GB02/01107 




wo 02/080025 



PCT/GB02/01107 



12/12 




PATENT COOPERATION TREATY 

PCT 

DECLARATION OF NON-ESTABLISHMENT OF INTERNATIONAL SEARCH REPORT 
(PCT Article 17(2)(a), Rules 13ler.1(c) and Rule 39) 



Applicant's or agent's file reference 

A26126 WO 


IMPORTANT DECLARATION 


Date of mdJSmg(day/month/year) 

07/06/2002 


International application No. 

PCT/GB 02/01107 


International filing 69iie(clay/month^year) 
12/03/2002 


(Earliest) Priority 6BtB(day/month/^r) 
28/03/2001 


International Patent Classification (IPC) or both national classification and IPC 


G08F17/60 


Applicant 

BRITISH TELECOMMUNICATIONS PUBLIC LIMITED COMPANY 



international search report will 



This International Searching Authority hereby declares, 'according to Article 17(2}(a), that no 
be established on the International applicaUon for the reasons indrcated below 

1 . [X] The subject matter of the international application relates to: 

a. I 1 scientific theories. 

b. I I mathematical theories 

c. I I plant varieties. 

d. I I animal varieties. 

e. rn essentially biological processes for the production of plants and animals, other than microbiological processes 
— and the products of such processes. 

f. g] schemes, rules or methods of doing business. 

g. schemes, wles or methods of performing purely mental acts. 

h. schemes, niles or methods off playing games. 

I. Q methods for treatment of the human body by surgeiy or therapy, 
j. Q methods for treatment of the animal body by surgery or therapy, 
k. Q diagnostic methods practised on the human or animal body. 
I Q mere presentations off Information. 

m. □ computer programs for which this International Searching Authority Is not equipped to search prior art 

2 n The failure of the following parts of the international application to comply with prescribed requirements prevents a 
' ^ meaningful search from being carried out: 

n the drawings 



fn the description 



\~] the claims 



3 n The failure of the nucleotide and/or amino acid sequence listing to comply with the standard provided for in Annex C of the 
' Administrative Instructions prevents a meaningful search from being carried out: 

Q the written fbmi has not been ffurnished or does not comply with the standard. 

I I the computer readable form has not been furnished or does not comply with the standard. 
4. Further comments: 



Name and mailing address of the International Searching Authority 

^ European Patent Office, P.B. 5818 Patentiaan 2 

^Xft NL-2280 HV Rijswijk 
QJji Tel. (+31-70) 340-2040. Tx. 31 651 epo nl. 
Fax: (+31-70) 340-3016 



Authorized officer 



ML Rodriguez N6voa 



Form PCT/ISA/203 (July 1998) 



International Application No. PCT/GB 02/01107 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 203 



The claims relate to subject matter for which no search is required 
according to Rule 39 PCI. Given that the claims are formulated in terms 
of such subject matter or merely specify comnonplace features relating to 
its technological implementation, the search examiner could not establish 
any technical problem which might potentially have required an inventive 
step to overcome. Hence it was not possible to carry out a meaningful 
search into the state of the art (Art. 17(2)(a)(i) and (ii) PCT; see 
Guidelines Part B Chapter VIII, 1-6). 

The applicant's attention is drawn to the fact that claims relating to 
inventions in respect of which no international search report has been 
established need not be the subject of an international preliminary 
examination (Rule 66.1(e) PCT). The applicant is advised that the ERG 
policy when acting as an International Preliminary Examining Authority is 
normally not to carry out a preliminary examination on matter which has 
not been searched. This is the case irrespective of whether or not the 
claims are amended following receipt of the search report or during any 
Chapter II procedure. If the application proceeds into the regional phase 
before the EPO, the applicant is reminded that a search may be carried 
out during examination before the EPO (see EPO Guideline C-VI, 8.5) » 
should the problems which led to the Article 17(2) declaration be 
overcome. 



